Tuning Parallel I/O on Blue Waters for Writing 10 Trillion Particles

نویسندگان

  • Surendra Byna
  • Robert Sisneros
  • Kalyana Chadalavada
  • Quincey Koziol
چکیده

Large-scale simulations running on hundreds of thousands of processors produce hundreds of terabytes of data that need to be written to files for analysis. One such application is VPIC code that simulates plasma behavior such as magnetic reconnection and turbulence in solar weather. The number of particles VPIC simulates is in the range of trillions and the size of data files to store is in the range of hundreds of terabytes. To test and optimize parallel I/O performance at this scale on Blue Waters, we used the I/O kernel extracted from a VPIC magnetic reconnection simulation. Blue Waters is a supercomputer at National Center for Supercomputing Applications (NCSA) that contains Cray XE6 and XK7 nodes with Lustre parallel file systems. In this paper, we will present optimizations used in tuning the VPIC-IO kernel to write a 5TB file with 5120 MPI processes and a 290TB file with 300,000 MPI processes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recent Progress in Tuning Performance of Large-scale I/O with Parallel HDF5

Large-scale scientific simulations running on hundreds of thousands of cores produce massive amounts of data that often needs to be stored in files. Analysis applications run on thousands of cores to access data files in order to extract useful information. Both, simulation and analysis codes, require highlevel I/O libraries that offer superior data access performance for writing and reading da...

متن کامل

Massively Parallel I/O for Partitioned Solver Systems

This paper investigates approaches for massively parallel partitioned solver systems. Typically, such systems have synchronized “loops” and will write data in a well defined block I/O format consisting of a header and data portion. Our target use for such an parallel I/O subsystem is checkpoint-restart where writing is by far the most common operation and reading typically only happens during e...

متن کامل

Many-Task Computing and Blue Waters

This report discusses many-task computing (MTC), both generically and in the context of the proposed Blue Waters systems. Blue Waters is planned to be the largest supercomputer funded by NSF when it begins production use in 2011–2012 at NCSA. The aim of this report is to inform the Blue Waters project about MTC, including understanding aspects of MTC applications that can be used to characteriz...

متن کامل

LIOProf: Exposing Lustre File System Behavior for I/O Middleware

As parallel I/O subsystem in large-scale supercomputers is becoming complex due to multiple levels of software libraries, hardware layers, and various I/O patterns, detecting performance bottlenecks is a critical requirement. While there exist a few tools to characterize application I/O, robust analysis of file system behavior and associating file-system feedback with application I/O patterns a...

متن کامل

Parallel Volume Rendering on the IBM Blue Gene/P

Parallel ray casting volume rendering is implemented and tested on an IBM Blue Gene distributed memory parallel architecture. Data are presented from experiments under a number of different conditions, including dataset size, number of processors, low and high quality rendering, offline storage of results, and streaming of images for remote display. Performance is divided into three main sectio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015